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1 Introduction 



1.1 Statement of problem 

In the introduction to his second volume, Knuth Knu97 discusses the computer genera- 
tion of pseudo-random numbers. He gives several cautionary tales about poor methods of 
generating these, including a function whose description is so complicated that it mimics 
iterations of a function chosen at random from all functions from {1, . . . , 10 10 } to itself. The 
exercises (see Exercises 11-15 on page 8 of |Knu97j ) then lead one through an analysis of a 
model where a function from [m] := {1, . . . , m} to itself is chosen uniformly at random. The 
poor performance of this pseudo-random number sequence is related to the cycle structure 
of a random map and is well understood. In particular, one may see readily that the average 
length of the cycle of numbers produced from a random seed is of order ^Jm and the cycle 
length from the best seed is not much longer. 

Knuth then proposes the following generalization Knu97, Problem 19, page 9, labeled 
M48]. A function / is chosen uniformly from among the m*-" 1 *^ functions from [m] k to [m]. 
Given an initial vector of values in [m] k for (X\, . . . ,Xk), an infinite sequence of values is 
produced by the rule 

X n +k = f {x n , . . . , X n+ k_i) . (1.1) 

The problem is to determine the average length of the period of this eventually periodic 
sequence if the initial k seeds are chosen at random, and to answer as well some related 
questions: what is the chance that the eventual period has length 1, what is the average 
maximum cycle length over all seeds, what is the chance that there is no seed giving a cycle 
of length 1, and what is the average number of distinct eventual cycles as the seed varies? 

1.2 Heuristic 

A thumbnail computation shows that one might expect equally poor performance from 
this multiple dependence model. Let W n G [m] k denote (X n , . . . , X n+ k_i) and let \i < r 
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be such that W T -k+i = W^-k+i but the values of W up to W T -k are distinct; thus the 
eventual period is r — fi and the length of the sequence of X values before repeating is 
t — k. Although the values {W n : n > 0} are no longer independent in the generalized 
model, one may hope that they are nearly independent, so that the value of the random 
quantity r is well approximated by the number ./V of IID uniform draws from a population 
of size M := m k needed to obtain the first repeated value. This is the classical "birthday 
problem" (see Example (3d) on page 33 and the discussion on page 49 of |Fel50| ) . It is 
known that the mean of ./V in the birthday problem is of order \[~M and more precisely 
that N 2 /2M converges in distribution to a mean-one exponential as M — > oo. One would 
therefore expect (Theorem 11.21 below) that r 2 /2m fc / 2 converges to a mean-one exponential 
as well. 

1.3 Background 

The problem of random number generation is of coure fundamental to the theory of com- 
puting. There are many classes of problems, including various factoring, counting and 
optimization problems, for which randomized algorithms give solutions much faster (on av- 
erage) then any known non-randomized algorithm. The use of randomization in practice, 
is if anything, more widespread than would be justified by theoretical results. Monte Carlo 
methods are ubiquitous in the areas of scientific computing, for example, and banks use 
vast tables of pre-generated random numbers to price derivative securities. 

The need for sources of effectively random numbers has mushroomed with the explosion 
in computational ability. Meanwhile, just as in the 1950's, the best sources are pseudo- 
random number generators, which are, for all their potential flaws, less prone to misbehave 
than are physical sources of randomness |Nie92j . 

To construct objects with random or chaotic properties, it is often easiest to pick them 
at random. Expander graphs, for example, are easy to construct at random but difficult 
to construct deterministically. When it comes to pseudo-random number generation, it is 
particularly appealing to use random or generic generators. The problem of finding a class 
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of generators, most of which generate good pseudo-random sequences, is therefore one of 
great interest to theorists and practitioners alike. 

The random unary map is a natural starting point for exploration of random pseudo- 
random sequences, but also underlies many other phenomena in probability theory, ranging 
from random trees to Brownian paths. For this reason it has been studied in great detail 
(see |Kol861 IAB821 |FT)901 IAP94j ) often with specific attention to the time before repetition, 
as in |AP94j . Its short cycle time, hence unsuitability for random number generation, has 
long been understood, as is evident from the discussion in all editions of Knu97 . This has 
led to a huge industry in random number generation (see, e.g., |JN ie9 2 _). Classes of functions 
on [m] or [m) k such as linear feedback registers or congruential sequences are sought whose 
periods are much longer than periods of functions chosen at random. Interestingly, the short 
cycle length can be a boon rather than always a drawback. Pollard's rho-algorithm |Pol75j 
relies on the y/m cycle length to find any prime factor p of n in time 0(y/p); Pollard's 
heuristic argument is in fact borne out empirically (see the discussion in SF96 ( pages 466- 
471]). 

Researchers studying random number generation via k-aiy maps appear to have taken 
for granted that, as in the unary case, these random maps cycle in a relatively short time. 
Settling this, however, was a problem stated already in the 1981 edition of Knu97 j , given 
a rating of [M48], and left unsolved. 

1.4 Results 

The purpose of this note is to show that the thumbnail computations are correct. All of 
the questions posed in Knuth may be correctly answered using the independence heuristic. 

In the case k = 2 the following result holds. Let m be given and let t, as defined above, 
be the least value for which W T _i repeats a previous value Wj for j < r — 1. 

2 

Theorem 1.1 As m — > oo ; the quantity converges in distribution to an exponential 
of mean 1. Furthermore, all moments of converge to moments of the exponential. In 
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particular, 

E r ~ m 
Var(T) ~ . 

For general k we will show: 

Theorem 1.2 For any /ixee? fc and x, dsm-> oo, 

r 2 

P( t > x) -> exp(-x) . 

Despite the fact that the arguments are straightforward, a careful analysis may be 
justified for several reasons. First, the question has gone unanswered for sufficiently long 
(and not for lack of interest) that the methods of analysis, though straightforward, must 
not be readily apparent. It will therefore be useful to introduce to the computer science 
community two techniques that are well known to probabilists and theoretical statisticians. 
In order to illustrate the range of available techniques for this kind of analysis, two different 
proofs will be presented. 

The first is a direct, combinatorial analysis and will be presented for the case k = 2 (as 
is stated in Problem 16 to be the first interesting generalization), though it can easily be 
generalized to larger k. It relies on the concept of hazard rate, well known in actuarial circles. 
A brief introduction to this concept is given in the next section. The second analysis uses 
the Poisson approximation machinery of AGG89 , which relies on some technical lemmas 
of |BE83j and concepts developed by Chen and Stein in the 1980's. This method is discussed 
in Section 3. Although the Chen-Stein method is not elementary, the present application 
of this machinery is straightforward. 

A second reason for undertaking this analysis is re-inject questions about basic random 
models into the stream of scientific discussion. Often a result on a basic model will rekindle 
interest in simple variations and lead to a branch of research previously overlooked by a 
community that tends towards depth-first research agendas. 
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Lastly, and perhaps most importantly, understanding the behavior of iterations of a 
random A;-ary map may prove useful for other random models. Just as short cycles of 
random unary maps have been used in Pollard's rho-algorithm and elsewhere, it is not 
hard to imagine the random k-ary map underlying various other structures, algorithms and 
heuristics. 

2 Time before repetition when k = 2 

Given any sequence X := {X\, X2, ■ ■ ■} of values in [m], we define the positive integer 
r = t(X) as above to be minimal so that W T —k+i = Wu-k+i f° r some k < ji < t (where Wj 
are sub-words of length k of the X vector, as in the introduction). When the X vector is 
random, we let J- n denote the <r-field cr{X\ , . . . , X n ). Compare the distributions of the vector 
(r, Xx, . . . , X T ) under two different measures for X: (a) when X satisfies the recursion (jl.lj) 
with Xq, . . . , Xk_\ IID uniform on [m] and (b) when X is an IID sequence of uniform draws 
from [m\. Under both (a) and (b), the conditional probability of X n+ \ = j given T n is 
1/m as long as r > n. The vector (r, X\ , . . . , X T ) therefore has the same distribution 
under either law on X. The main subject of our analysis it the distribution of r and other 
quantities measurable with respect to T T . We will therefore assume throughout that X is 
an infinite IID uniform sequence. 

Let m be given. In the remainder of this section, the dependence length, k, is fixed at 
two; thus t is minimal so that (X T —i,X T ) = (Xu-i,Xu) for some n < r. Arguments will 
be based on the hazard rate principle, an informal statement of this is the following. 

The exponential lieftime of mean 1 is charactarized by the property that at any given 
time t, if you haven't died yet, then you have chance dt of dying in the interval (t, t + dt). 
Now suppose we observe an individual and determine at every time t a chance fit) dt 
for him to die in the next dt. This will be by defintion a random variable measureable 
with respect to events up to time t (weight, smoking habits, and so on). It is no longer 
an easy matter to determine the unconditional lifetime distribution of the population 
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from typical information about the random function /, but we can say somthing under 
a random time change. If the lifetime always has a conditional future density, then the 
amount of hazard up to the time of death, defined by J„ T f(t) dt where T is the death 
time, will always be exponentially distributed with mean 1 (see |Jeu80l Prop. 3.28]). 
This principle is often used for bounding the tails P(T > x), e.g., by e~ c + P(/(x) < c). 

The proof of Theorem II. II is an elementary chain of asymptotic equalities. Letting £ (1) 
denote an exponential of mean 1, we will show that 

v t 2 

£(l) = hazard ~ linearized hazard ~ ^ . 

v ' 2m? 

The first equality states that in the present discrete-time context, one may still find a time 
change under which the lifetime is an exponential. The trick is to introduce auxilliary ran- 
domness to form the fractional part of the lifetime. This is Lemma 12 . 1 1 below . which is true 
for any stopping time. The remaining asymptotic inequalities then deal with approxima- 
tions introduced by the discrete time steps and by a small amount of unpredictability in 
the hazard rate (cf. the methods of Pcm96 ). 

Lemma 2.1 Let r > be a stopping time on a probability space (f2,P) with respect to a 
filtration {T n : n > 0} and let h(k) be the random variable defined by h(k) = — log(l — ^4^) 
on the event that r > k and arbitrarily otherwise, where 

A k := P(r = k + 1 1 F k ) . 

Let 

h = h* + Y,h(j) 

3=0 

where h* is a random variable, whose conditional distribution given h(r—l) = x is a mean 1 
exponential conditioned to be less than x. Suppose ^2h(j) = oo almost surely. Then h is 
distributed exactly as a mean 1 exponential. 
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Proof: Given x > 0, let G n be the event that YJj=l h U) < x < YTj=oKj)- Then 
G n G J^n-i and 

oo 

p(/j> x ) = £>(/i>x,G„) 

n=l 

oo 

71=1 
OO 

= ^El Gn P(/i>x|.F n 



-1/ 




-Mi) 1 P -(x~Z"Zo 



since £V h(j) = oo implies ^ n l(3 n = l. □ 

The remainder of the proof of Theorem 11.11 involves combinatorial specification of the 
hazard rate. Apply the hazard rate lemma to the quantity r in the statement of Theorem ll.il 
resulting in quantities h(k) and h satisfying h = £(1). Define Yj. to be the number of j < k 
for which Xj = X}~. An easy lemma is: 

Lemma 2.2 As m — * oo, m~ 1 / 2 m&x{Yk : k < r} — > in probability. 

Proof: Keep a tally of how many times each value has been seen in the sequence X\,X2, ■ ■ ■■ 
Since these are independent draws, it is evident that with probability 0(e _Ce ^") for some 
c e > 0, no value is taken on e^/rn times before every value is taken on e^fmjl times. At 
a time T e when every value has been taken on ey/rn/2 times, the hazard function h is at 
least c(e)m, where c(e) is a constant not depending on m. It follows for fixed e > that 
the probability of r > T e is exponentially small in m, and consequently that the probability 
of max{Yfc : k < r} exceeding €y/rri is at most the sum of two probabilities that are 
exponentially small in \/rn, and hence that it tends to zero as m — > oo. □ 
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Recast the definition of in terms of 
to obtain the following immediate consequence. 



t = k + ll^k) = — 

m 



Corollary 2.3 For every e > there is a c e > such that 

T-l 



h - £ h(j) 

3=0 



with probability at least 1 — c e 1 (eyLY> c t y/m) 



< em 



-1/2 



Proof: By the definition of h, 



T-l 



> h - Kj) > -Hr - 1) = log(l 

3=0 

-1/2 



m 



(2.1) 



By Lemma 12.21 this is at most em ' except on a set of measure tending to zero exponen- 

□ 



tially in y/m for each fixed e. 

The cumulative linearized hazard rate 



H(k) :=J2 A 3 

3=1 

is close to ^7=0 h(j) but easier to work with. We will see that 



j=0 



.„ (5) 

fe 2 • 



To quantify the last approximation, for j < m, let be the number of i < A; for which 

Xj = j. Then, counting pairs of occurrences of each value, an alternate definition of H (k) 
is: 

m (T k (j)\ 



77? 
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Lemma 2.4 If k 2 /m — > oo ; then 
in probability. 



Proof: Denote the first moment, second moment and variance of by fik,Sk and Vk 
respectively. We may compute these as follows. \ik = E( Tk ^). We compute Hk as the 
expected number of pairs (i,j) of indices at most k for which X; L = Xj = 1. Clearly then 



^ - „ 2 



m 2 



Compute m 2 Sfc as mETk(l) 2 + m(m — 1)^^(1)^(2). Counting ordered pairs of un- 
ordered pairs for which X u = X v = 1 and X w = X x = 2, we see that 

(5) ft 2 ) 



£T fc (l)T fc (2) 



m 
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In a similar way, allowing for (u;, cc) to have two, one or zero elements in common with 
(u, v), we get that 



Summing gives 

/fe\ (k - 2\ Mm - 1) m @2(fc-2) (*) 



Then 



and 



t/ _ c „2 _ (2) (2) 



V fc m - 1 



The lemma now follows from Chebyshev's inequality. □ 

Remark: In order to prove convergence of all moments, one must estimate EH(k) p for 
integers p > 2. There is an expansion analogous to the equation m 2 Sk = mETfc(l) 2 + 
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m{m — l)ETjfc(l)Tfc(2). Say that a descending vector of positive integers is a partition of m 
if A = (Ai, . . . , A^a) and Y^f=i = m - Let denote the product Yij=i Tk{j) Xj • Then 

m p EH(k) p = ^m #A (l + 0(m" 1 ))ET fc A 
A 

where the sum runs over partitions of m. Here, the multiplier m^ A (l + 0(m -1 )) gives the 
number of ways of choosing distinct ji, ■ ■ ■ , J#a- When A is the partition (1, . . . , 1), the 
leading term of the sum is 

m p — - k 

leading to a contribution of (1 + 0(m~ 1 ))(k 2 / (2m 2 )) p . For any other A, ET^ is a sum of 
terms of the form 0(k/m) a with 1 < a < p. Each of these terms appears in EH(k) p with 
the multiplier m* A_p , which is 0(m _1 ). The total number of these terms is bounded, so it 
follows that when k/m > e, 

EH(k) p = (l + 0(m~ 1 ))^J . (2.2) 

In other words, for k/m > e, 2m 2 Hj t /k 2 converges to 1 in each LP as m — ► oo, uniformly in 
k. 

Proof of Theorem II. II Convergence in distribution will follow from a comparison of 
H(k) and Ylj=o^U)- From the definitions, 

fcAr-l fcA(r-l) 

E h ® = E -iog(i-^fc) 

3=0 j=l 

fcA(-r-l) 

= e ^+o(^) 2 

3=1 



H(k A (r - 1)) 1 + O max A- 

Y?<*A(t-1) J 



= ff(fc A (r - 1)) ( 1 + O ( max -£ ] ) . (2.3) 
By Lemma l2~2l this shows that Ylj=i Mj)/-^( T — 1) — > 1 in probability as m — > oo. Since 
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T 2 /m — > oo in probability asm-> oo, Lemma 12.41 may be applied to show that 



9T7 2 



(2.4) 



j=0 



in probability as m —* oo. Together with Corollary 12.31 this implies that 



2m^ , 

— 5-/1 — * 1 

in probability as m — > 00, and convergence in distribution of r 2 /(2m?) to £ (1) then follows 
from the hazard rate lemma. 



To extend this to convergence of higher integral moments, argue as follows. We know 



that 



8(1) = h. 



Let || • ||p denote the LP norm. From Corollary 12.31 we see that 

h 



1 



(2.5) 



(2.6) 



as m — > 00. Let G be the event that maxjlfc : k < r} is at most m _1//2 . It was shown 
in the proof of Lemma 12.21 that the probability of G c decays exponentially in ^fm. It was 
already shown in 1)2 .3|) that 



1 + 0{m 



-1/2^ 



H(r - 1) 

on G, which, together with the decay of P(G C ) faster than any polynomial, leads to 

"^(r-l) 11 ^ 1 

as m -> co. Finally, the estimate 1)2.2)1 in the case k 2 /m > e together with convergence of 
T 2 /m to 00 in probability and monotonicity of H(k) in k imply that 

u H(r-l) 



(2.7) 



1 



(2.8) 



"r 2 /(2m 2 )" p 

as m — > 00. The chain p. 5)1 - 1)2.8)1 of asymptotic equivalences in LP proves the last statement 
of the theorem. □ 
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3 Analysis of X n+k = f(X n , . . . , X n+k ^ x ) for any k > 2 via Pois- 
son approximation 

In this section we will prove Theorem Ol Let N := [V2 m k x\. Then T 2 /(2m k ) > x if and 
only if the values of W n for < n < N are distinct. Let Z be the number of pairs (i, j) for 
which < i < j < N and Theorem ll.2l is an immediate consequence of: 

Lemma 3.1 The total variation distance between the law of Z and a Poisson of mean x is 
o(l) as m — » oo. 

The proof of Z is via the Chen-Stein Poisson Approximation method. The classical 
Poisson approximation result Dur96, page 137] says that if events are 

• independent, 

• each has probability at most e, which is going to zero, 

• and the sum of the probaiblities converges to A, 

then the number that occur converges in law to a Poisson of mean A. The Chen-Stein method 
is a means of weakening the independence assumption. It suffices that most events be 
independent, and the rest not too dependent. Stein Ste88j developed an abstract framework 
for quantifying such limit laws. Later authors such as |AGC89j provided useful hypotheses 
for getting numerical bounds on the distance, in total variation, to a Poisson distribution. 

Proof of Lemma 13. 11 For the duration of this proof, a and a' will be shorthand for 
and (i! respectively. Let S denote the set of a for which < i < j < N. Let G a denote 
the event that W f = Wj. Define p a = F(G a ) and p aa > = F(G a n G a >). 

Let B(a) be the set of a' for which \y — y'\ < k for some y G {i,j} and y' £ {i',j'}. 
Note that for a' ^ B(a), the event G a > that Wv = Wj' is measurable with respect to 
{X s : \s — i\, \s — j\ > k}. Therefore, 

G a is independent of a(G a i : a' ^ B(a)) . (3-1) 
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Define 

h := ^2 /2 PaPa' I (3-2) 

a a'eS(a) 

b 2 := ^2 ^2 Paa' • (3-3) 

The quantities &i and 62 are quantities appearing under the same name in AGG89, The- 
orem 1]; their quantity 63 is zero due to the independence relation Q3.1JI . The conclusion 
of |AGQ89[ Theorem 1] is that \F(Z = 0) - exp(EZ)| < 61 + b 2 . It remains to identify EZ 
and to bound 61 and b 2 from above. 

Observe first the claim that for any a, p a = m~ k . This is obvious for \i — j\ > k. But 
in fact for any i and j, G a occurs if and only if Xi +r = Xj +r for < r < k. For j = i + s 
with < s < k, the values of Xj, . . . ,Xj +s _i may be chosen arbitrarily, and there will be 
precisely one set of values of Xi +S , . . . , Xi +s+ k-\ for which G a occurs, proving the claim. 
It follows that 

X~EZ = Y^p a = m- k ( N 2 ] =(l + o(l))x. (3.4) 

Observe next that the cardinality of B(a) is at most 8£;./V, since the number of pairs 
(i',f) with i' within k of i is at most 2kN, and similarly for the other three possibilities. 
It follows immediately that 

h < C^2Pa)(8kN)m~ k < (8V2 + o(l))kx 3 / 2 m- k/2 . (3.5) 

a 

Finally, we bound b 2 from above. Let Bq{o) denote the set of a' for which both of i 1 
and j' are within k of either i or j. Then |5o(a)| < (4k) 2 . 

Claim: for a ^ a' £ Bq(o), 

Pa,a' < rtr k ~ l . 

Assume without loss of generality that j < j' , since the other cases, j > j', i < i' and i > i' 
are similar. Then 

< p a ¥(G a i I X n : n < f) < m k m 1 , 
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proving the claim. 



For ol ^ Bo(a), one conditions on {X s : \s — i\ < k or |s — j\ < k} to see that p c 



rn 



-2k 



One then has 



= ^2 X] Paa ' + X] Paa ' 

a a'£B (a) a a'£B (a) 

< ^ [lQk 2 m~ k - 1 + (8fciV)m~ 2fc ] 

a 

< Sk 2 N 2 m- k - 1 + 4kN 3/2 m- 2k 

= (1 + o(l))(16/fc 2 Am~ 1 + (42 3/4 A 3/2 /tm- 



(3.6) 



as m — > oo. Combining (|3.4j) - establishes that b\ + b<i = o(l) and EZ — x = o(l), 

which completes the proof of Theorem 11.21 □ 



4 Further discussion 

Let U and S{\) be independent with U uniform on [0, 1] and £ (1) exponential of mean 1. 
The following extension of the distributional convergence results may be proved. Recall 
that /i is the index for which Xu, . . . , X T _i is the first full period of the eventually periodic 
sequence of pseudo-random numbers. 

Theorem 4.1 As m ^ oo, the pair (fi, r) converges in distribution to (U£(l),£(l)). 

Complete proof of the extensions in this section will not be given, but the argument, 
along the lines of the first analysis, is as follows. Fix an integer r and break the hazard rate 
for the occurrence of r into r components. The j th component at time n is the hazard rate 
for the occurrence of r = n + 1 and (J — l)/r < fj, < j/r. A lemma analogous to Lemma l2.4l 
shows that the r hazards accumulate at asymptotically equal rates, and a lemma analogous 
to the hazard rate lemma then shows the asymptotic uniform distribution of /x/r over the 
r bins given \rr\ . Sending r to infinity completes the argument. 
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An analysis of the probability of landing in a cycle of length 1 is easiest along the lines of 
the Poisson approximation. Indeed, the number of occurrences of W n of the form (j, . . . , j) 
for some j < m by time k is well approximated by a Poisson of mean fem 1- *; the number 
of these followed by one more j is then nearly a Poisson of mean km~ k . Since r is of order 
m fc / 2 , one sees that the mean number of these occurrences by time r is 0(m _fc//2 ), so this 
gives the order of magnitude of the chance of being caught in a cycle of length 1. On the 
other hand, the probability that some seed results in a cycle of length 1 is the chance that 
one of the m words (j, . . . , j) maps to itself, which rapidly approaches 1 — e _1 asm-» oo. 

An upper bound on the maximum value of r over all seeds is obtained as follows. In the 
spirit of Theorems II .11 and II .21 the probability that t 2 > 2(1 + e)m k (k\ogm) can be shown 
to be close to exp(— (l+e)k log m). Indeed, while Theorems ll.ll and ll.2l as written, compute 
(2m k )x) only when x is fixed, the arguments are sufficient to handle poly-logarithmic 
growth of x, that is x < (logm) p . Specifically, the four chains in the asymptotic equalities 
when x grows at this rate are: the exact equality h = S(l) as before; the difference between 
h and Y^j-o ^C?) sman i n every L p \ the linearization error in replacing h by H changes the 
likelihood of exceeding a hazard of x from e~ x to e - x +°( x ) ^ an d the ratio between H(k) and 
its deterministic counterpart k 2 /(2m 2 ) is small as long as x is not too small (as before). 
One may then extend the estimate to slowly growing x: 

P(r 2 > 2(1 + e)km k log m) ~ m~ (1+e)fc . 

Since there are m k seeds, this gives 



P r* > y / bkm k logm (4.7) 
for any b > 2, where r* is the supremum over seeds of the value of r for a fixed random /. 

A interesting theoretical problem, perhaps do-able with only a modest amount of effort, 
would be to make this more precise and give a sharper estimate. For iterations of a unary 
map, the distribution of r* is known. Its mean was shown in |FO90l Theorem 7] to be 
asymptotic to an explicit contant multiple of \fm. Later the scaling limit of m^ 1 / 2 r* was 
shown to exist jAP94j and an explicit formula given |AP02I Theorem 1]. 
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Problem: Let T*(m,k) be the supremum over all seeds of the value of r for iterations of 
one random function f : [m] k — > [m]. Show that mT k l 2 T* converges weakly to the same 
limit as described in \AP94[ \AP02f . 

If this proves difficult, perhaps it could at least be shown that r* = 0(m k / 2 ) in proba- 
bility, that is, that the extra factor of log under the radical in Q4.7j) is superfluous. 

Acknowledgement: The author would like to thank Philippe Flajolet for bringing this 
problem to light at the 8 th Analysis of Algorithms meeting in Strobl. 
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